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55 as a relational database, an object database, or an XML document. The related items display names can be formatted and ranked 
according to the properties of the item, so as to avoid duplication and information overload for the user. The ranked data can be 
displayed along curved segments in a graphical chart, so as to allow convenient visualization of the data and the data's relationship 
to the principal item that may be prominently displayed at tbe center of the chart. The visualization of data sets containing a large 
fs| number of items from data sets is enabled, while respecting the size constraints of the display and maintaining an optimal level of 
<^ user comprehension. The visualization has the property that subsets of items in a data set relate to each other, and the relationships 
between items have a value associated with each other. Local rankings of the relationships between terms are established, by ranking 
fH items that relate to another. Next, a visualization is generated in the form of a graphical chart, by presenting results separately for 
t each item in a predetermined data set and adjusting the presentation to avoid information overlap and overload. Separate charts for 
*2 each item of the data set can be presented. Further, closeness is expressed by proximity of items along a curved segment, which are 
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^5 overload, the items related to a particular item are grouped by strength of affinity. Each related item is individually spaced on the 
^ affinity chart, with each item being placed in a non-overlapping position. Items with large numbers of related items are presented 
with multiple affinity charts. In the case of multiple affinity charts, a first affinity chart visualizes a set of most strongly related items. 
Next or subsequent related affinity charts visualize less strongly related items. Color and shading gradations and curve thickness 
gradations are selectively employed to emphasize the curve's role in conveying affinity strength. Items are placed so they do not 
overlap or crowd each other. Arbitrarily large data sets are visualized using low and localized computational resources. 
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Background 

Field of the Invention 

The present invention relates to visualization systems and methods and more 
particularly to systems and methods for enabling visualization of data sets containing 
1 5 large numbers of objects. 

Description of the Related Art 

U.S. Patent No. 5,966,1 39, granted to Anupam et al. and assigned to Lucent 
Technologies Inc., relates to scalable data segmentation and visualization. In particular, 

20 the ' 1 39 patent describes visualization of relatively large amounts of data in a limited 
display space, including segmentation of data, mapping of segments to a node within a 
display space based upon a specified nodal layout, texture mapping each graphical 
display to its node, and displaying the data at each node. The visualization scheme 
enables a user to map a relation to a specification of an n-dimensional display, by 

25 designating how attributes are to be used. 

However, the visualization system of the 1 1 39 patent fails to represent data sets 
having two properties, namely that subsets of items in a data set relate to each other, 
and the relationships between items have an affinity value associated with each other. 
Furthermore, a system according to the * 139 patent does not simplify the display of data 

30 to the user by presenting multiple screens of only the amount of data that can be 
comfortably fit on a screen. 



WO 01/35271 



PCT/US00/29819 



Summary of the Invention 
The present invention discloses methods and systems for the visualization of 
data sets containing a large number of interrelated objects (or items) from data sets 
having two properties, namely that subsets of items in a data set relate to each other, 
5 and the relationships between items have a value associated with each other. In 
particular, according to an embodiment of the present invention, a compact, easily 
understood, and easily navigable visual representation of objects in a data set can be 
achieved. 

As a preparatory step, local rankings of the relationships between items are 

10 established, by ranking the items i that relate to each item j, and ranking all items k to 
which item j relates, thereby ranking the affinity to each item j of item sets i and k. 
Next according to an embodiment of the present invention, a visualization can be 
generated, by presenting results separately for each item in a predetermined data set and 
adjusting the presentation to avoid information overlap and overload. Separate 

15 representation of each item of the data set can be accomplished by generating an affinity 
chart for each item j in the data set, to display items closely related to selected item j, 
■ with item j being placed prominently in the affinity chart, and placing items which are 
more strongly related to item j closer to item j. Further, closeness is expressed along 
curves or shaped segments which may be completely or partially straight, which are 

20 connected or which are emanating from item j's position. 

According to one embodiment of the present invention, continuous curves 
including but not limited to spiral segments, are employed to connect items relating to 
item j at different intensity levels. To adjust the visualization to avoid information 
overlap and overload, the items related to a particular item j are grouped by strength of 

25 affinity. Each related item is individually spaced on the affinity chart, with each item 
being placed in a non-overlapping position. Items with large numbers of related items 
are presented with multiple affinity charts. In the case of multiple affinity charts, a first 
affinity chart visualizes a set of most strongly related items. Next or subsequent related 
affinity charts visualize less strongly related items. According to an embodiment, 

30 curves can be used to represent the relationship of items related to a particular item 

positioned at a starting point for the curve. Distance along the affinity curve represents 
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the strength of the affinity to the item at the starting point of the curve. Color and 
shading gradations and curve thickness gradations are selectively employed to 
emphasize the curve's role in conveying affinity strength. Items are placed so they do 
not overlap or crowd each other. Arbitrarily large data sets are visualized using low and 
5 localized computational resources. 

Brief Description of the Dra wings 
Fig. 1 is an affinity chart according to an embodiment of the present invention. 
Fig. 2 is an affinity chart of other related items for the affinity chart of Fig. 1 . 
10 Fig. 3 is a flowchart illustrating one method of visualizing large interrelated data 

sets. 

Fig. 4 is a data flow diagram showing the flow of data through a system for 
visualizing large interrelated data sets. 

Fig. 5 depicts a system for visualizing large interrelated data sets. 
1 5 Fig. 6 illustrates a type of database structure that can be used as a source for the 

rankings used by a system visualizing large interrelated data sets. 

Fig. 7 is another affinity chart, according to the present invention. 

Fig. 8 is another affinity chart of other related items for the affinity chart of Fig. 

7. 

20 

Detailed Description 
Visualization of a data set refers to the use of various techniques to convey the 
overall structure of information by visual means. In particular, visual cues can be used 
to represent relations between objects. Visual cues can include, for example, using 
25 segments of a curve to represent the affinity (or strength) of the relationship between 
objects, and using gradations of the width and color of the curves to represent the 
intensity of the affinity relationships between the objects. 

Referring now to Fig. 1 , there is shown a diagram of most closely related items 
according to the present invention. In particular, Fig. 1 shows an affinity chart 129 
30 including first and second affinity curves 130 and 140 including a principal item 131 
and first and second pluralities of related items 1 32 and 142. Each related item 132, 
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142 includes a navigational link 133, 143 respectively and a search link 134, 144 
respectively. Adjacent to the principal item 131 and at one end of a selected strings of 
related items 132 and 142 are respective first sequence element 135 and 145, and 
adjacent to a last item of the selected strings of related items 1 32, 142 is a second 

5 sequence element 136, 146 which provides a link to a supplementary affinity chart for 
more remotely relevant strings of related items 132, 142. 

In an embodiment, the affinity chart 1 29 may consist of a single list of textual or 
graphical items and associated links. The principal item 13 1, related items 132 and 
142, first sequence elements 135, 145, and second sequence elements 136, 146 may all 

10 appear as items of the list. In this list, each related item may appear associated with a 
navigational link 133, 143, and a search link 1 34, 144. Such an affinity chart may be 
required when the display in which it is presented only accommodates lists. 

Referring now to Fig. 2, there is shown a diagram of other lesser related items to 
principal item 131 than those shown in Fig. 1 , according to the present invention. In 
15 particular, Fig. 2 shows an affinity chart 249 representing those items that would have 
been reached as a result of selecting first sequence element 1 35 in Fig. 1 . Affinity chart 

249 in Fig. 2 includes first and second affinity curves 250 and 260 including a principal 
item 131 and first and second pluralities of related items 252 and 262 that are lesser 
related to principal item 131 than related items 132 and 142 shown in Fig. 1. Each 

20 related item 252, 262 includes a navigational link 253, 263 respectively and a search 
link 254, 264 respectively. Adjacent to the principal item 13 l and at one end of a 
selected strings of related items 252 and 262 are respective first sequence elements 255 
and 265, which provide a link to the affinity chart of more strongly related items, and 
adjacent to a last item of the selected strings of related items 252, 262 is a second 

25 sequence element 256, 266 which provides a link to a supplementary affinity chart for 
more remotely relevant strings of related items 252, 262. By selecting the first 
sequence element 255 or 265, the user can navigate to reach the affinity curves 240 or 

250 to view a string of relevant items as shown in Fig. 1 . 

Fig. 3 is a flow chart depicting a method for visualization of large interrelated 
30 data sets, according to the present invention. According to this method, the 

visualization of data sets containing a large number of items from data sets having two 
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properties, namely that subsets of items in a data set relate to each other, and the 
relationships between items have a value associated with each other, is enabled. 

In an embodiment of the invention, a method of visualization of large 
interrelated data sets can include an information structuring phase 305, a chart layout 

5 phase 3 1 0, and an information linking phase 315. In information structuring phase 305, 
the relationships between objects in a data set and the intensity of those relationships 
can be computed. As a preparatory step in the information structuring phase, an item 
from the data set can be selected in step 320. Local rankings of the relationships 
between items can be established in step 324, by ranking for each selected item j the 

1 0 items i that relate to that item j, and then ranking all items k to which item j relates, 

thereby ranking the affinity for each item j to item sets i and k. As a result, the rankings 
of the related items are relative (or local) to item j, but are not a universal measurement 
of the importance of the item. To determine how an item relates to another item, the 
strength of the relationship between the items can be computed using any combination 

1 5 of objective or subjective criteria, or a combination of both. A value can be associated 
with each criterion, and a plurality of these values can be reduced to a single value (i.e. 
an affinity value), by, for example, adding them all together and then normalizing the 
value. 

Referring back to Fig. 1 , principal node 1 3 1 ("The Beatles") would have item 
20 set i and item set k related to it. Spiral curve 1 30 depicts item set i (i.e. those items that 
are related to principal node 131), while spiral curve 140 depicts item set k (i.e. those 
items to which principal node 1 3 1 relates). To determine both item set i and item set k 
in Fig. 1 , both objective and subjective criteria related to The Beatles could be used. 
Objective criteria used to determine the relationships between various musical bands 
25 could include, for example, the era in which the band played (e.g. 1 960s), and the genre 
of the music (e.g. rock, British Invasion, pop). Subjective criteria could include, for 
example, how well liked the band is based on feedback from users, and how often two 
bands appear together in radio station play lists. 

In an embodiment, a single affinity value can be determined from the various 
30 criteria to represent the strength of the relationship between objects, and the related 
items to the selected item can be ranked in step 324 by the affinity values associated 
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with each related item. Once ranked, those related items can be clustered in step 328. 
Clustering refers to the process of appropriately grouping the ranked objects. For 
example, an arbitrary number (erg. ten) of the most closely related items can be chosen : 
as a cluster. Once clustered, the number of affinity charts can be computed in step 332. 
5 Thus, using the above example, if twenty-eight related items exist, step 332 can result 
in a computation of three affinity charts needing to be generated, two of which would 
have ten items and the third haying eight items. The first ten would be the most closely 
related to the principal item, the next ten would be the next most closely related, and the 
last eight would be the next most closely related. 
10 Next in an embodiment of the invention, chart layout phase 3 1 0 can cause the 

organization and placement of the relevant subsets of the objects in charts or graphs that 
can be used to display the relationships between the objects. The complete set of charts 
constitutes a virtual map of the data set. 

Chart layout phase 31 0 can begin by positioning the selected item as a principal 
1 5 node in a chart in step 338. This can include the placement of a hyperlink for the 

selected item in a specified position of the chart. Next a visualization can be generated, 
by presenting results separately for each item in a predetermined data set and adjusting 
the presentation to avoid information overlap and overload. In an embodiment, 
separate presentation for each item of the data set can be accomplished by generating an 
20 affinity curve in step 342 for each item j in the data set, to display items closely related 
to selected item j, with item j being placed prominently in the affinity chart, and placing 
items which are more strongly related to j closer to j. Further, closeness can be 
expressed along curves or shaped segments which may be completely or partially 
straight, and which may be connected or may emanate from j's position, 
25 According to an embodiment of the present invention, continuous curves 

including but not limited to spiral segments, can be employed to connect items relating 
to j at different intensity levels. In step 346, a related item can be selected and in step 
350, the size required for that related item can be determined. In step 354, the related 
item can be individually spaced on the affinity chart by its rank, with each item being 
30 placed in a non-overlapping position by allowing sufficient vertical and horizontal 
displacement in step 358. A determination can be made in step 360 of whether any 
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more related items need to be placed on the affinity curve. If so, control returns to step 
346 where the next item is selected. If there are no more related items to process, the 
color and size gradients of the curve can be adjusted in step 362 to emphasize the 
affinity between the items. 

5 A selected item with a large number of related items can be presented with 

multiple affinity charts. In the case of multiple affinity charts, a first affinity chart can 
be used to visualize a set of most strongly related items. Next or subsequent related 
affinity charts can be used to visualize less strongly related items. 

According to an embodiment, arbitrarily large data sets can be visualized using 

10 low and local computational resources. During information linking phase 315, the 
sequence of affinity charts for a selected item can be hyperlinked in step 366. Each 
related item can then be linked to its own chart in step 370. Once the selected item and 
its associated affinity curves from the data set have been hyperlinked, navigation by the 
user can occur by the user clicking to connect to a selected related affinity chart. 

1 5 Further, as a result of information linking phase 315, each item may have separate 
features that can be activated by the user clicking on those features. 

Upon completion of information linking phase 3 1 5, a determination can be 
made in step 381 of whether more charts need to be generated for the selected item. If 
so, control can pass to step 338 where the selected item can be placed in a new chart. If 

20 no further charts are needed for selected items, a determination can be made in step 386 
of whether more items in the data set need charts generated for them. If so, control can 
pass to step 320 where a determination is made of a new selected item and the process 
repeats. 

Fig. 4 is a data flow diagram showing the flow of data through a system for 
25 visualizing interrelated data sets, according to an embodiment of the present invention. 
In the system shown in Fig. 4, a user 405 of a computer 41 2 containing a client browser 
410 (as is well known and understood in the art) can send a request 415 via a 
distributed computer network (such as the Internet) to a web server 420. Web server 
420 can process the request and ultimately provide the interface for user 405, including 
30 the ability to navigate visualized data sets. 
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Upon receiving the request, web server 420 can process the request and send the 
request on to chart server 425. A determination can then be made in step 430 of 
whether the particular chart corresponding to the request had previously been cached. 
This could have occurred if a prior request had been made for the same data. If so, the 

5 necessary information can be provided from chart cache 435. By way of example only, 
the information could consist of charts and hyperlinked data 440. 

If the particular chart corresponding to the request had not previously been 
cached, the request can be passed on to affinity server 445. Affinity server 445 can 
have available to it object affinity tables 450 that contain information regarding the 

1 0 items in the data set, including, for example, the fields detailed in the affinity tables 

shown in Fig. 6. After receiving a request, affinity server 445 can access object affinity 
tables 450 for the information needed to structure the chart corresponding to the 
request, according to information structuring phase 305 described with respect to Fig. 3. 

1 5 After being retrieved from chart cache 435 or generated by affinity server 445, 

the charts and hyperlinked data 440 can be passed to chart server 425 where the 
visualized data set can be graphically laid out and linked together. Upon completion, 
chart server 425 can pass the graphically laid .out and linked visualized data set to web 
server 420, which, in turn, can pass it on to client browser 4 1 0 being used by user 405. 

20 As a result, the visualized data set can be displayed to user 405. 

Fig. 5 depicts an embodiment of a system according to the present invention for 
visualizing large interrelated data sets. As shown in Fig. 5, web server 505 can receive 
image request 507 for a particular artist. Upon determining that image request 507 is to 
generate a visualization of data from a data set, that request can be passed to chart 

25 server 5 1 0. Chart server 5 1 0 can then determine whether the necessary information to 
fulfill the request (i.e. the affinity chart) has already been calculated and cached. If so, 
image map 509 can be retrieved by chart server 5 1 0 from cache 5 1 5 and returned to 
web server 505 for display to the user. The retrieval of the bit map can then cause 
bitmap request 511, which can then be followed by the retrieval of the associated 

30 bitmaps from cache 5 1 5 and the display of bitmap 5 1 3 to the user. 
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If the information needed to fulfill the request has not been cached, chart server 
510 can fulfill the request by retrieving information about the requested item from 
database 550. In particular, database 550 can contain data objects 555 that contain a 
distinct affinity value (A) between the requested item (Ol) and each related item(02). 

5 In response to a query by cache server 5 1 0, one or more data objects 555 for the 

requested item can be retrieved from database 550 and utilized by affinity server 520 to 
generate the necessary information (including, for example, image maps and bitmaps) 
for the selected item. Affinity server 520 can utilize an affinity ranking means 540 to 
rank the related items by their affinity values, and an image layout program 545 and 

1 0 image layout interpreter 535 to assemble the images. Affinity server 520 can also 

utilize image map generator 525 and bitmap renderer 530 to generate the actual image 
map and bitmaps. Once generated, the image map and bitmaps can be placed in cache 
515 and transferred to web server 505. Web sever 505 can then present to the user the 
visualization of the interrelated data associated with the requested item. 

1 5 Fig. 6 illustrates a type of database structure that can be utilized in the process 

of visualizing large interrelated data sets, according to an embodiment of the present 
invention. Within a particular data set, each item can be identified within a table 605 in 
a database by an object identifier (OID) 610 and an object name 615 (that may also 
contain object properties). The OID associated with each item is a primary key (PK) 

20 620. Two OlDs from table 605 serve as primary key 620 for the affinity strength table 
625. These two OlDs are labeled as foreign keys (e.g. FK1 and FK2 in field 640) of the 
second affinity strength table 625 within the database. Table 625 can be accessed by 
fixing the OID values to obtain the affinity value 635 between two items identified by 
their OIDs (e.g. OID #1 and OID #2 in field 630). 

25 In an alternative embodiment, the information about the related items within the 

data set can be stored as an XML document with a document type descriptor (DTD) 
rather than in database tables. One example of a DTD for such an XML document 
could be: 

<?xml version= M 1.0" encoding="UTF-8" standalone= M yes"?> 
30 <!DOCTYPE affinityChart [ 
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<!ELEMENT SIMILAR (1TEM,(L1ST)*)> 
<!ELEMENT LIST (ITEM)+> 

<!ATTL1ST LIST arm (upper| lower) "upper"> . r 

<!ELEMENTITEM(SEARCH,NAVIGATJON)> 
5 <!ATTLIST ITEM type (principal |related) "related"> 

<! ELEMENT SEARCH (#PCDATA)> 
<!ELEMENT NAVIGATION (#PCDATA)> 

Referring now to Fig. 7, there is shown a visualization of a data set of 
10; interrelated items, and further showing the most closely related items to a principal item 

711, according to an embodiment of the present invention. In particular, Fig. 7 shows 
an affinity chart 709 including an affinity curve 710 having a principal item 711 and a 
plurality of related items 7 1 2. Each related item 71 2 can includes a navigational link 
713 and a search link 714. Each navigational link 713 can permit a user to generate a 

15 new visualization of a data set of interrelated items related to its associated related item 

712. Each search link 714 can permit a user to produce a page that has search results or 
additional information about the selected item. Adjacent to the principal item 711 and 
at one end of a selected string of related items 712 is a first sequence element 715. and 
adjacent to a last item of the selected string of related items 712 is a second sequence 

20 element 7 1 6, each of which provide a link to a supplementary affinity chart for either 
more or less remotely relevant strings of related items 7 1 2. In Fig. 7, principal item 7 1 1 
is shown as tc union carbide productions". This item represents a musical group. 
Related musical groups having varying levels of affinity to "union carbide productions" 
are shown as well. For example, while the related item 712 representing the musical 

25 entity "Sarah McLachlan" has an affinity to principal item 711, other related items 
(such as related item 720 representing the musical entity "the Cure" and related item 
725 representing "R.E.M ") have a stronger affinity, as indicated by their closer 
proximity to principal item 7 1 1 on the curve. Similarly, still other related items may 
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V 



have a weaker affinity to principal item 71 1 , as indicated by their further proximity 
from principal item 71 1 on the curve. 

Referring now to Fig. 8, there is shown a diagram of other related items to the 
principal item in Fig. 7, according to the present invention. In particular, Fig. 8 shows 

5 an affinity chart 8 1 9 representing those items that would have been reached as a result 
of selecting first sequence element 715 in Fig. 7. In particular, Fig. 8 shows an affinity 
chart 819 including an affinity curve 820 including a principal item 71 1 and a plurality 
of related items 822. Each related item 822 includes a navigational link 823 and a 
search link 824. Adjacent to the principal item 711 and at one end of a selected string 

10 of related items 822 is a first sequence element 825, which provides a link to the 

affinity chart of more strongly related items, and adjacent to a last item of the selected 
string of related items 822 is a second sequence element 826 which provides a link to a 
supplementary affinity chart for more remotely relevant strings of related items 822. By 
selecting the first sequence element 826, the user can navigate to reach the affinity 

1 5 curve 71 0 to view a string of relevant items as shown in Fig. 7. 

The methods and apparatuses of the present invention provide a visualization 
technique that allow a set of related data items to be represented with respect to their 
relationship to a principal data item. The set of related data items may be in any of 
several forms and the display of the related items eliminates duplication and 

20 information overload for the user. The related items may be displayed in many ways, 
including along curved segments or in list form, so as to allow convenient visualization 
of the data and the data's relationship to the principal item. 

While the invention has been described in detail, including references to specific 
embodiments, it will be apparent to one skilled in the art that changes and modifications 

25 can be made to the invention without departing from the spirit and scope thereof. For 
example, while a particular embodiment related to musical artists has been disclosed, 
the invention can be applied equally as well for other data types. Thus, it is intended 
that the present invention cover the modifications and variations of this invention 
provided they come within the scope of the appended claims and their equivalents. 
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What is claimed is: 

1 . A method of information structuring in a data set containing a plurality of 
interrelated objects, comprising: 

5 ranking related objects based upon relationship strength; 

clustering related objects; and 
computing the number of affinity charts per object. 

2. A method, as in claim 1 , wherein said ranking step further comprises: 

10 for each related object to a selected object, calculating an affinity value between 

each of said related objects and said selected object based upon one or more criteria; 
and 

ordering each of said related objects in said data set according to said affinity 
value between said related object and said selected object. 

15 

3. A method, as in claim 2, wherein said one or more criteria includes an objective 
measurement. 

4. A method, as in claim 2, wherein said one or more criteria includes a subjective 
20 measurement. 

5. A method of generating a graphical layout, comprising: 
selecting a principal node for said graphical layout; 

generating at least one affinity chart in connection with said principal node; and 
25 sequentially establishing related items along the at least one affinity chart by 

rank. 

6. A method, as in claim 5, wherein said at least one affinity chart further 
comprises an affinity curve. 

30 

7. A method, as in claim 5, wherein said at least one affinity chart further 
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comprises a list of related items. 

8. The method according to claim 5 further comprising the step of positioning the 
selected principal node at a prominent location in said graphical layout. 

5 

9. The method according to claim 5 wherein for each related item in an affinity 
chart the size of the item is computed. 

10. The method according to claim 6 wherein gradients are used to suggest item 
10 affinity level. 

1 1 . The method according to claim 10, wherein said gradient further comprises a 
color gradient. 

15 12. The method according to claim 1 0 5 wherein said gradient further comprises a 
size gradient. 

13. The method according to claim 6 wherein provision is made to allow a 
sufficient vertical and horizontal displacement interval to prevent overlap of related 

20 items. 

14. A method of information linking, comprising: 

hyperlinking related graphs of visualizable large interrelated data sets; and 
navigating between hyperlinked related graphs. 

25 

1 5. A method for providing visualization of items from data sets, the method 
comprising: 

determining, for a plurality of items from said data set, a set of properties, said 
set of properties including a relationship to each other of the subsets of items in the data 
30 set, and a value applied to the relationships between the items; 
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applying local rankings of the relationships between terms, by ranking items i 
that relate to each item j, and ranking all items k to which item j relates, thereby ranking 
the affinity of each item j to item sets i and k; 

generating a visualization by presenting results separately for each item in a data 
5 set and adjusting the presentation to avoid information overlap and overload; and 

providing separate presentation for each item of the data set by generating an 
affinity chart for each item j in the data set, thereby displaying items closely related to 
selected item j, with item j placed prominently in the affinity chart, and placing items 
which are more strongly related to j closer to j. 

10 

1 6. The method of claim 1 5, further comprising the step of expressing closeness 
along shaped segments, emanating from j's position. 

1 7. A method as in claim 1 6, wherein said shaped segments further comprise curved 
15 segments. 

1 8. The method of claim 1 5, further comprising: 

employing continuous curves including spiral segments, in order to connect 
items relating to a item (j) at different intensity levels; 
20 adjusting the visualization to avoid information overlap and overload, the items 

related to item (j) grouped by strength of affinity; 

providing an affinity chart, and spacing each related item individually with each 
item placed in a non-overlapping position; 

presenting items with large numbers of related items with multiple affinity 
25 charts, and in the case of multiple affinity charts, providing a first affinity chart to 

visually represent a set of most strongly related items and providing next or subsequent 
related affinity charts to visually represent less strongly related items; 

using curves to represent a relationship of items related to a particular item 
positioned at a starting point for the curve, with distance along the curve representing a 
30 strength of an affinity to the item at the starting point of the curve; and 
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selectively employing color and shading gradations and curve thickness 
gradations are to emphasize the curve's role in conveying affinity strength, while 
placing items so they do not overlap or crowd each other. 

5 19. A method for providing visualization of data sets containing a large number of 
items from said data sets, the method comprising: 

employing continuous curves including spiral segments, in order to connect 
items relating to a primary item at different intensity levels; 

adjusting the visualization to avoid information overlap and overload, the items 
1 0 related to said primary item grouped by strength of affinity; 

providing an affinity chart, and spacing each related item individually with each 
item placed in a non-overlapping position; 

presenting items with large numbers of related items with multiple affinity 
charts, and in the case of multiple affinity charts, providing a first affinity chart to 
1 5 visually represent a set of most strongly related items and providing next or subsequent 
related affinity charts to visually represent less strongly related items; 

using curves to represent a relationship of items related to a particular item 
positioned at a starting point for the curve, with distance along the curve representing a 
strength of an affinity to the item at the starting point of the curve; and 
20 selectively employing color and shading gradations and curve thickness 

gradations are to emphasize the curve's role in conveying affinity strength, while 
placing items so they do not overlap or crowd each other. 

20. A method for providing visualization of arbitrarily large data sets using low and 
25 local computational resources, the method comprising: 

determining, for at least a plurality of said data sets, a set of properties, said set 
of properties including a relationship to each other of the subsets of items in the data 
set, and a value applied to the relationships between the items; 

determining at least one primary item for the visualization; 
30 applying local rankings of the relationships between terms, by ranking a first 

relational set of items that relate to the primary item, and ranking a second relational set 
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of items to which the primary item relates, thereby ranking the affinity to each primary 
item to the first relations set of items and to the second relational set of items; 

generating a visualization by presenting results separately for each item in a 
predetermined data set and adjusting the presentation to avoid information overlap and 
5 overload; 

providing separate presentation for each item of the data set by generating an 
affinity chart for each primary item in the data set, thereby displaying items closely 
related to a selected primary item, with the primary item placed prominently in the 
affinity chart, and placing items which are more strongly related to the primary item 
10 closer to the primary item; 

expressing closeness along curves or shaped segments, connected or emanating 
from the primary item's position; 

said expression of closeness including completely or partially straight shaped 
segments; 

15 . employing continuous curves including spiral segments, in order to connect 

items relating to a primary item at different intensity levels; 

adjusting the visualization to avoid information overlap and overload, the items 
related to the primary item grouped by strength of affinity; 

providing an affinity chart, and spacing each related item individually with each 
20 item placed in a non-overlapping position; 

presenting items with large numbers of related items with multiple affinity 
charts, and in the case of multiple affinity charts, providing a first affinity chart to 
visually represent a set of most strongly related items and providing next or subsequent 
related affinity charts to visually represent less strongly related items; 
25 using curves to represent a relationship of items related to a particular item 

positioned at a starting point for the curve, with distance along the curve representing a 
strength of an affinity to the item at the starting point of the curve; and 

selectively employing color and shading gradations and curve thickness 
. gradations are to emphasize the curve's role in conveying affinity strength, while 
30 placing items so they do not overlap or crowd each other. 
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21. A method for providing visualization of large interrelated data sets, the method 
comprising: 

determining a relationship strength of related items in a data set; 
for each item in the data set, ranking related items based on the relationship 
5 strength; 

clustering related items based on said ranking; 
computing a number of affinity charts per item; 
establishing clusters of related items; 

repeating the steps of ranking related items based on the relationship strength 
1 0 and computing the affinity charts until a desired information structure is achieved; 
positioning a principal node prominently in the affinity chart; and 
generating entries in said affinity chart emanating from the principal node for 
each of said clusters of related items. 

15 22. A method as in claim 2 1 , wherein said step of generating entries in said affinity 
chart further comprises laying out graphs of one per cluster of related items. 

23. A method as in claim 2 1 , wherein said step of generating entries in said affinity 
chart further comprises populating a list of related items. 

20 

24. A chart server for creating a graphical layout of a visualization of an interrelated 
data set, comprising: 

means for selecting a principal node for said graphical layout; 
means for generating at least one affinity curve from an affinity chart in 
25 connection with said principal node; 

means for sequentially establishing related items along the at least one affinity 
curve by rank; 

means for linking related affinity charts of interrelated data sets; and 
means for navigating between linked related affinity charts. 

30 

25. An affinity server for structuring information for use in visualization of a large 
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interrelated data set, comprising: 

means for ranking related objects based upon relationship strength; 

means for clustering related objects; and 

means for computing the number of affinity charts per object. 

5 

26. A computer readable medium containing computer program instructions for 
providing visualization of items from data sets, said computer program instructions 
containing instructions for: 

determining, for at least a plurality of said data sets, a set of properties, said set 
1 0 of properties including a relationship to each other of the subsets of items in the data 
set, and a value applied to the relationships between the items; 

applying local rankings of the relationships between terms, by ranking items i 
that relate to each item j, and ranking all items k to which item j relates, thereby ranking 
the affinity to each item j to item sets i and k; 
15 generating a visualization by presenting results separately for each item in a 

predetermined data set and adjusting the presentation to avoid information overlap and 
overload; and 

providing separate presentation for each item of the data set by generating an 
affinity chart for each item j in the data set, thereby displaying items closely related to 
20 selected item j, with item j placed prominently in the affinity chart, and placing items 
which are more strongly related to j closer to j . 

27. A system for providing visualization of items from data sets at a first computer 
operably coupled to a second computer over a communications network, comprising: 

25 a computerized server associated with said second computer, said computerized 

server including data set visualization software executable on said computerized server 
and configured to: 

determine, for a plurality of said data sets, a set of properties, said set of 
properties including a relationship to each other of the subsets of items in the data set, 
30 and a value applied to the relationships between the items; 
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apply local rankings of the relationships between terms, by ranking items i that 
relate to each item j, and ranking all items k to which item j relates, thereby ranking the 
affinity of each item j to item sets i and k; 

generate a visualization by presenting results separately for each item in a data 
5 set and adjusting the presentation to avoid information overlap and overload; and 

providing separate presentation for each item of the data set by generating an 
affinity chart for each item j in the data set, thereby displaying items closely related to 
selected item j, with item j placed prominently in the affinity chart, and placing items 
which are more strongly related to j closer to j. 

10 
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